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Abstract. We prove that there is e > and po > such that for every prime p > po, every 
subset S of Z/pZ which satisfies \2S\ < (2 + e)\S\ and 2(|25|) - 2\S\ + 3 < p is contained in 
an arithmetic progression of length \2S\ — \S\ + 1. This is the first result of this nature which 
places no unnecessary restrictions on the size of S. 



1. Introduction 

In 1959 Freiman [2] proved that if S is a set of integers such that 

|25| < 3|5| -4 

then S is contained in an arithmetic progression of length |25| — \S\ + 1. 

This result is often known as Freiman's (3k — 4) Theorem. It has been conjectured that the 
same result also holds in the finite groups Z/pZ of prime order. Working towards this conjecture, 
Freiman [2] proved (see Nathanson [J3] for the following formulation of the result): 

Theorem 1 (Freiman |2j). Let S C TLfpTL such that 3 < \S\ < cqp and 

\2S\ < a\S\ -3, 

1 /2 

with < Co < 1/12, ci > 2 and (2c\ — 3)/3 < (1 — cqC\)/c-^ . Then S is contained in an 
arithmetic progression of length \2S\ — \S\ + 1. 

The largest possible numerical value of c\ given by this theorem is c\ ~ 2.45, which falls some- 
what short of the value predicted by the conjecture (namely 3). In addition, Theorem [1] only 
guarantees the result for sets S that are small enough. For example, to guarantee c\ = 2.4, the 
theorem needs the assumption \S\ < p/35. This last assumption was improved to \S\ < p/W.7 
by R0dseth [TJ] but without improving the value of the constant c\ . 

It follows from a recent result of Green and Rusza [1] on rectification of sets with small doubling 
in Z/pZ that the value of c\ can actually be pushed all the way to 3 while preserving the 
conclusion that S is contained in a short arithmetic progression, but this comes at the expense 
of a stringent condition on the size of S: namely the the extra assumption \S\ < 10~ 180 p. 
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In the present paper, we shall work at the conjecture from a different direction. Rather than 
focusing on the best possible value for the constant c±, we shall try to lift all restrictions on the 
size of S. First we need to formulate properly what should be the right version of Freiman's 
(3k — 4) theorem in Z/pZ. 

For — 1 < m < \S\ — 4, we want the condition \2S\ = 2\S\ + m to imply that S is included in an 
arithmetic progression of length \S\ + m + 1. One fact that has not been spelt out explicitly in 
the literature is that for such a result to hold, some lower bound on the size of the complement 
Z/pZ \ 25 of 2S must be formulated. Indeed, ii p — \2S\ is too small, the conclusion will not 
hold even if m is small compared to \S\ — 4. Consider in particular the following example. Let 
S = {0} U {m + 3,m + 4, . . . , (p + l)/2}. We have |25| = p - (m + 1) = 2\S\ + m, but it 
can be seen with a little thought that S is not included in an arithmetic progression of length 
\S\ + ra + l. For the desired result to hold, we must therefore add the condition p—\2S\ > m + 1. 
We conjecture that this extra condition is sufficient for a Z/pZ- version of Freiman's (3A; — 4) 
theorem to hold. More precisely : 

Conjecture 2. Let S C Z/pZ and let m = \2S\ — 2\S\. Suppose that m satisfies : 

-1 < m < mm{\S\ - 4,p - \2S\ - 3}. 
Then S is included in an arithmetic progression of length \S\ + m + 1. 

Note that p — \2S\ = p — 2\S\ — m can not be equal to m + 2, otherwise p would be an even 
number. Therefore condition (ii) of the conjecture is equivalent to p — \2S\ > m + 1, as implied 
by the example above. 

We remark that the cases m = —1,0,1 of this conjecture are known. They are implied by 
Vosper's theorem |18| (m = — 1), by a result of Hamidoune and R0dseth [9j (m = 0) and by a 
result of Hamidoune and the present authors [10J (m = 1). In the present paper we shall prove 
conjecture [2] for all values of m up to e\S\, where e is a fixed absolute constant. More precisely, 
our main result is : 

Theorem 3. There exist positive numbers po and e such that, for all primes p > po, any subset 
S ofZ/pZ such that 

(i) \2S\ < (2 + e)|5|, 

(ii) m = \2S\ — 2\S\ satisfies m < min{|S'| — A,p — \2S\ — 3}, 
is included in an arithmetic progression of length \S\ + m + 1. 

We shall prove this result with the numerical values e = 10~ 4 and po = 2 . 
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In the past, the dominant strategy, already present in Freiman's original proof of Theorem [TJ 
has been to rectify the set S, i.e. find an argument that enables one to claim that the sum S + S 
must behave as in Z, and then apply Freiman's (3k — 4) theorem. Rectifying S directly however, 
becomes more and more difficult when the size of S grows, hence the different upper bounds 
on S that one regularly encounters in the literature. In our case, without any upper bound on 
S, rectifying S by studying its structure directly is a difficult challenge. Our method will be 
indirect. Our strategy is to use an auxiliary set A that minimizes the difference \S + A\ — \S\ 
among all sets such that \A\ > m + 3. The set A is called an (m + 3)-atom of S and using such 
sets to derive properties of S is an instance of the isoperimetric (or atomic) method in additive 
number theory which was introduced by Hamidoune and developed in [6|[71l8l[Tu]lllj. The point 
of introducing the set A is that we shall manage to prove that it is both significantly smaller 
than S and also has a small sumset 2A. This will enable us to show that first the sum A + A, 
and then the sum S + A, must behave as in Z. Finally we will use Lev and Smelianski's distinct 
set version |12] of Freiman's (3k — 4) Theorem to conclude. 

The paper is organised as follows. The next section will introduce /c-atoms and their properties 
that are relevant to our purposes. In Section 3 we will show how our method works proving 
Theorem [3] in the relatively easy case when m is an arbitrary constant or a slowly growing 
function of p (i.e. logp). In Section 4 we will prove Theorem [3] in full when m is a linear 
function of \S\. 

2. Atoms 

Let S be a subset of Z/pZ such that € S. For a positive integer k, we shall say that S is 
k-separable if there exists X C 7LjpL such that |X| > k and \X + S\ < p — k. 

Suppose that S is fc-separable. The fc-th isoperimetric number of S is then defined by 

K k (S) = mm{\X + S*| - \X\, | X C Z/pZ, \X\ > k and \X + S\ < p - k}. (1) 

For a fc-separable set S, a subset X achieving the above minimum is called a k-fragment of S. 
A A;-fragment with minimal cardinality is called a k-atom. 

What makes fc-atoms interesting objects is the following lemma : 

Lemma 4 (The intersection property [6]). Let S be a subset of Z/pZ such that G S, and 
suppose S is k-separable. Let A be a k-atom of S. Let F be a k-fragment of S such that A <f_ F . 
Then \A D F\ < k - 1. 

The following Lemma is proved in |llj : 
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Lemma 5. Let S C Z/pZ with \S\ > 3 and G S. Suppose S is 2-separable and K2(S) < |«S |+m. 
iei A be a 2-atom of S. Then \A\ < m + 3. 

Lemma [5] implies the following upper bound on the size of atoms. 

Lemma 6. Let k > 3 and let A be a k-atom of a k-separable set S C Z/pZ with G S, \S\ > 2 
and Kk(S) < \S\ + m. Then \A\ < 2m + k + 2. 

Proof. The set ^4 is clearly 2-separable. Let Bbea 2-atom of A with G -B, so that \B + A\ < 
\B\ + \A\ + m. Let b € B, b ^ 0. By Lemma[5]we have |B| < m + 3. Therefore, 

\AD(b + A)\ = |{0, 6} + A\ < \B + A\ < \A\ + 2m + 3. (2) 

But 6+ A is also a /c-atom of 5. By the intersection property, it follows that | An (6+ ^4) | <k — 1. 
Hence 2|A| — (A; — 1) < |A U (b + A)| which together with ([2]) gives the result. □ 

From now on S will refer to a subset of Z /pZ satisfying conditions (i) and (ii) of Theorem [3] for 
a fixed e > to be determined later, and m always denotes the integer m = \2S\ — \S\. Without 
loss of generality we will also assume G S. 

Note that condition (ii) implies that S is (m + 3)-separable so that (m + 3)-atoms of S exist. 
Note that by the definition of an atom, if X is an atom of S then so is x + X for any x G Z/pZ. 
Therefore there are atoms containing the zero element. 

In the sequel A will denote an (m + 3)-atom of S with G A. We will regularly call upon the 
following two inequalities: 

\S + A\ < \S\ + \A\ +m (3) 
which follows from the definition of an atom, and 

\A\ < 3m + 5. (4) 
which follows from Lemma [6] with k = m + 3. 

The reader should also bear in mind that for all practical purposes, inequality @ means that 
we will only be dealing with cases when \A\ is significantly smaller than \S\. Indeed, we shall 
prove Theorem [3] for a small value of e, namely e = 10 -4 , so that 3m is very much smaller than 
\S\. We can also freely assume that |5| > p/35, since otherwise Freiman's Theorem Q] gives the 
result with e = 0.4. The prime p will also be assumed to be larger than some fixed value po to 
be determined later. 
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3. The case m < logp 

In this section we will deal with the case when m is a very small quantity, i.e. smaller than 
a logarithmic function of p. This will allow us to introduce, without technical difficulties to 
hinder us, the general idea of the method which is to first show that A must be contained in a 
short arithmetic progression and then to transfer the structure of A to the larger set S. It will 
also serve the additional purpose of allowing us to suppose m > 6 when we switch to the looser 
condition m < e|iS|. 

We start by stating some results that we shall call upon. The first is a generalization of Freiman's 
theorem in Z to sums of different sets and is proved by Lev and Smelianski in |12j . we give it 
here somewhat reworded (see also [T71 Th. 5.12]). 

Theorem 7 (Lev and Smelianski [E]). Let X and Y be two nonempty finite sets of integers 
with 

\X + Y\ = \X\ + \Y\+fx. 

Assume that \x < min{|X|, \Y\} — 3 and that one of the two sets X,Y has size at least fj, + 4. 
Then X is contained in an arithmetic progression of length \X\ + fj, + 1 and Y is contained in 
an arithmetic progression of length \Y\ + fj, + 1. 

The second result we shall use is due to Bilu, Lev and Ruzsa fTJ Theorem 3.ljjj] and gives a 
bound on the length of small sets in Z/pZ. By the length £(X) of a set X C Z/pZ we mean the 
length (cardinality) of the shortest arithmetic progression which contains X. 

Theorem 8 (Bilu, Lev, Ruzsa [lj). Let X C Z/pZ with \X\ < log 4 p. Then £(X) < p/2. 

Theorem [8] will be used to show that, when m is small enough, then the atom A is contained in 
a short arithmetic progression. 

Lemma 9. Suppose that 6m + 11 < log 4 p. Then A is contained in an arithmetic progression 
of length 2{\A\ - 1). 

Proof. Since we assume | | > p/35, it follows from ([3]) and (j4]) that A is an (m + 4)-separable 
set. Let therefore B be an (m + 4)-atom of A containing 0, so that |5 + j4| < \B\ + \ A\ + m. By 
Lemma[6]we have \B\ < 3m + 6 so that |j4uS| < 6m + 11. By the present lemma's hypothesis, 
it follows from Theorem [8] that A U B is contained in an arithmetic progression of length less 
than p/2. The sum A + B can therefore be considered as a sum of integers, so that Theorem [7] 
applies and A is contained in an arithmetic progression of length \A\ + m + 1 < 2\A\ — 2. □ 

^In [T] their statement is slightly different from Theorem [8] but this is actually what they prove. 
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We now proceed to deduce from Lemma [9] the structure of S. It will be convenient to introduce 
the following notation. 

Recall that we denote by £(X) the length of the smallest arithmetic progression containing 
X. By £x(Y) we shall denote the length of a smallest arithmetic progression of difference x 
containing Y, where x is the difference of a shortest arithmetic progression containing X. 

The point of the above definition is that if we have £a(S) + £(A) < p then the sum S + A can 
be considered as a sum in Z, so that ([3]) and Theorem [7] applied to S and A imply Theorem [3l 
We summarize this point in the next Lemma for future reference. 

Lemma 10. If we can assume £a(S) + 1(A) <p then Theorem^ holds. 

Whenever we will wish transfer the structure of A to S we will assume that £a(S) + £(A) > p 
and look for a contradiction. We can think of this hypothesis as S having no 'holes' of length 
1(A). In the present case of very small m, the desired result on S follows with very little effort. 

Lemma 11. Suppose that 6m + 11 < log 4 p. Then S is contained in an arithmetic progression 
of length \S\ + m + 1. 

Proof. By Lemma [9J A is contained in an arithmetic progression of difference r, that we can 
assume to equal r = 1, and of length 2(\A\ — 1). In particular A has two consecutive elements. 
Without loss of generality we may replace A by a translate of A and assume that {0, 1} C A. Let 
S = Si U • • • U Sk be the decomposition of S into maximal arithmetic progressions of difference 
one, so that 

\S + A\ > \S\ + k. 

Because of ([3]) we have k < \A\ + m. By Lemma [TOl we can assume every maximal arithmetic 
progression in the complement of S to have length at most 1(A). Therefore, 

£ A (S) + 1(A) < \S\ + k£(A) < \S\ + (\A\ + m)2(\A\ - 1). 

Now by (JIJ) we get 

£ A (S)+£(A) < |5| + (4m + 5)(6m + 8) < \S\ + (log 4 p) 2 < | + (log 4 p) 2 

since |5| < p/2. We have log|p < p/2 for all p therefore we get £a(S)+£(A) < p, a contradiction. 

□ 

4. The general case 

4.1. Overview. When m grows we encounter two difficulties. First, Theorem [5] will not apply 
anymore to any set containing A, and we need an alternative method to argue that A is contained 
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in a short arithmetic progression. Second, even if we do manage to prove that A is contained in 
a short arithmetic progression, we will not be able to deduce the structure of S from ([3]) by the 
simple technique of the preceding section. 

We will now use an extra tool, namely the Pliinecke-Ruzsa estimates for sumsets; see e.g. |15|. I13J . 

Theorem 12 (Pliinecke-Ruzsa [E]). Let S and T be finite subsets of an abelian group with 
\S + T\ < c\S\. There is a nonempty subset S' C S such that 



The Pliinecke-Ruzsa inequalities applied to S and A will give us that there exists a positive 5 
such that either A is contained in a progression of length (2 — 5) (| A \ — 1) or 2A is contained 
in an arithmetic progression of length (2 — <5)(|2A| — 1) (Lemma I15p. We will then proceed to 
transfer the structure of A or 2A to S. 

Again we shall use Lemma flOl to assume that S does not contain a "gap" of length 1(A) or £(2A). 



We define the density of a set X C Z/pZ as p(X) = (\X\ - l)/£(X). If 1(A) < (2 - 5)(\A\ - 1) 



we will argue that the sum S + A must have a density at least that of A and get a contradiction 
with the upper bound on \S + A\. The details will be given in Subsection 14.31 

We will not be quite done however, because we can not guarantee that £(A) < (2 — 6)(\A\ — 1) 
holds. In that case we have to fall back on the condition £(2A) < (2 — £)(|2j4| — 1), meaning 
that it is the set 2A, rather than A, that has large enough density. In this case we have to work 
a little harder. We proceed in two steps: we first apply the Pliinecke-Ruzsa inequalities again 
to show that there exists a large subset T of S such that \T + 2A\ is small. We then apply 
the density argument to show that T must be contained in an arithmetic progression with few 
missing elements. We then focus on the remaining elements of S, i.e. the set S \ T. We will 
again argue that if this set has a gap of length £(A) the desired result holds and otherwise the 
density argument will give us that S + A is too large. This analysis is detailed in Subsection 14.41 
and will conclude our proof of Theorem [3j 



S' + jT\ < cP\S' 



4.2. Structure of A. 



Lemma 13. Suppose 6 < m < e\S\ with e < 10 4 . Then for any positive integer k < 32 we 
have 




Proof. Rewrite ([3]) as 
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with c = 1 + |<lj m . By Theorem [T2l (Pliinecke-Ruzsa) , for each k there is a subset S' = S'(k) 
such that 

\S' + kA\ <c k \S'\. (5) 

Apply and m > 6 to get \A\ < 3m + 5 < 4m. Since m < e\S\ we obtain for the constant c 
just defined c < 1 + 5e. We clearly have 

c fc 1 5" | < c k \S\ < (1 + 5e) fc |S| < 2|5| < p 

for A; < 32. Now apply the Cauchy-Davenport Theorem to S'+kA in ([5]) to obtain |S"| + |/ct4| — 1 < 
c fc |S"|, from which 

\kA\ < {c k - 1)\S'\ + 1 < (c k - 1)\S\ + 1. (6) 
Numerical computations give that 

k 2 

(1 + x) k < 1 + kx + —x 2 

for any positive real number x < 5.10 -4 and for k < 32. Hence, since c = 1 + (\A\ + m)/\S\ < 
1 + 5e, we can write, for k < 32, 

h A |A|+m\* , L4| + m /c 2 + m x 2 

c fc = 1 + i— L_ — < 1 + fc- 



|5| / " \S\ 2 V l^l 

Applied to ([6]) we get 

\kA\ < k(\A\+m) + -[ ^ l — > j+1 

, /, ,, x ( k (\A\ + m)\ 

< fc(|A|+m)^ + -iL_l_^j + i 

< fc(|A| + m)fl + ^+ 1 » 



2 

as claimed. □ 

Lemma 14. // 6 < m < e|5| wif/i e < 10 _4 ; i/ien ^4 and 2^4 are contained in an arithmetic 
progression of length less thanp/2. 

Proof. Put k = 2 j and c x = 2.44. Suppose that \2 j A\ > ci|2 J ' _1 A| - 3 for each 1 < j < 5. Then, 

|32A| > cf |A| - 3(cf - l)/(ci - 1) > 86\A\ - 179 > 65\A\ + 10, 

where in the last inequality we have used \A\ > m + 3 > 9. On the other hand, by Lemma [T3l 
we have 

5ke\ „, _ 5A;e, 



|fcA| < fc(|A| + m) ^1 + — J + 1 < 2k(l + —)\A\, (7) 
which, for fe = 32, gives \32A\ < 64(1 + 80e)|^4| < 65\A\, a contradiction. 
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Hence \2^A\ < c 1 \2i~ 1 A\ - 3 for some 1 < j < 5. Since 

\2 j - l A\ < \16A\ < 32(1 +40e)\A\ < 64(1 + 40e)ep < 8 • W~ 3 p, 

where again we have used inequality ([7]) for k = 16 and \A\ < 4m < 4e|S| < 2ep. It follows 
from Freiman's Theorem Q] (with cq = 8 • 10 -3 and ci = 2.44) that A C 2^~ l A is contained in 
an arithmetic progression of length at most 

\2 j A\ - \2 j ~ 1 A\ + 1< 1.44[2 J '- 1 yl| < (1.44)8 • l(T 3 p. 

In particular, A and 2A are included in arithmetic progressions of lengths less than p/2. □ 

Now that we know that A and 2A are contained in an arithmetic progression of length smaller 
than p/2, we can apply to them the Freiman's (3k — 4) Theorem to get the following result. 

Lemma 15. Suppose 6 < m < e\S\ with e < 10 -4 , and let < 5 < 10 _1 . If A is not contained 
in an arithmetic progression of length (2 — <5)(|j4| — 1) then 2A is contained in an arithmetic 
progression of length (2 — <5)(|2A| — 1). 

Proof Suppose first that \2A\ > (3 - S)(\A\ - 1) and \4A\ > (3 - S)(\2A\ - 1). Then 

\4A\ > (3 - 5f\A\ - (3 - 5f - (3 - 8) > (3 - 5) 2 |A| - 12. (8) 

On the other hand, Lemma [131 for k = A and e = 10 gives [4A| < 4(1 + 10e)([j4| + m) + 1. By 
using dHj) and m < \A\ — 3 we get 

(3 - 5f\A\ - 12 < 8(1 + lQe)\A\ - 12(1 + lOe) + 1. 

Since m > 6, we have |A| > m + 3 > 9. Therefore we obtain 

(3-5) 2 \A\ < ^8(l + 10e) + ij 

a contradiction for 5 < 0.1. 
Hence, 

(a) either \2A\ < (3 - 5)(\A\ - 1) < 3\A\ - 3, but since 1(A) < p/2 by Lemma HH Freiman's 
(3fc — 4) Theorem applies and A is contained in an arithmetic progression of length 
\2A\-(\A\-1)<(2-5)(\A\-1). 

(b) Or \4A\ < (3 - 5)(|2A| - 1) < 3\2A\ - 3, but using Lemma [H again, (3A; - 4)-Freiman's 
Theorem implies that 2A is contained in an arithmetic progression of length (2— <5)(|2A| — 

1)- 



□ 
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4.3. Structure of S when 1(A) is small. For a subset B C Z/pZ define the density of B by 

r, \ B \ -1 

The next lemma gives a lower bound for the cardinality of a sumset of two subsets B, C € Z/pZ 
when £(-B) + £(C) > p in terms of their densities. 

Lemma 16. Let e C C Z/pZ twift C C [0, £(C)) and £(C) < p/2. Ze£ Ii, , I 2t be the 
sequence of intervals defined by Ii = [(i — l)c, ic), where c = i(C) and t < p/2c. Let B C Z/pZ 
such that for every i = 1, . . . , 2t, we have Ii H B 0. Then, 

\b + c\> \bu [{B + c)ni]\ > \B\ + {t-he{C)( P c- |Z) 1 ' 



2' y ' V {2t-\)cJ ' 
where I = I± U . . . U l2t ■ 

Proof. Let B' = B t~) I. Let B^ = B' n %-i and B\ = B' <T\ I 2i and define 5^ = U*=i B o> 
^ = U*=i 5 i so that B' = B' UB[. Note that, since C C [0, c), 

(Bj + c) n (B 3 + C) = 

for i / j and that Bq + C C i2i-i U /2j- Therefore B'q + C can be written as the following union 
of disjoint sets. 

t 

B' + C= \J(Bl + C) C/iU...U/ 2t . 

i=i 

Hence, since every set Bq is nonempty, the Cauchy-Davenport Theorem implies 

|£ + C|>|i? |+i(|C|-l). (9) 

In a similar manner we have 

t-i 



(B[ + c)ni = \J(b\ + c) u (B? + c)ni 
i=i 
t-i 

D |J(Bi + C) U5f 



i=i 

so that, applying the Cauchy-Davenport Theorem for z = 1 ... t — 1, we get 



\(B' 1 + C)nI\>\B[\ + (t-l)(\C\-l). (10) 
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Now we have \B + C\ > \B\B'\ + \(B' + C)nI\ and likewise \B + C\ > \B\B'\ + \(B[ + C) nl\, 
hence, applying ([9]) and ([TO]) . 



|5 + C| > | J B\ J B / | + 5(l(-Bo + C)ni"| + |(Si + C)nJ|) 

> \B\-\B'\/2 + (t-~)(\C\-l) 

> \B\ + (t-\)c(pC ' L 



2' V ( 2 ^- 1 )c / 

which proves the result. □ 

Lemma [16] allows us to conclude the proof when the (m + 3)-atom A is contained in a short 
arithmetic progression. 

Lemma 17. Suppose 6 < m < e 1 1 wii/i e < 10~ 4 . Suppose furthermore that £(A) < (2 — 
<J)(|A| - 1). Then £(S) < \S\ + m+ 1. 

Proof. Set a = ^(^4). Write p = 2ta + r, < r < 2a and let Ji, . . . , Jj, . . . , -fet be the partition of 
[0, 2ta) into the intervals h = [(i - l)a, ia) and I = Ufl^j. Let S' = SnI. 

Suppose that £a(S) + > p. Then we have 7; n S" ^ for each i = 1, . . . 2i. By Lemma [TBI 
with B = S and C = A, 

|s + ^|>|si + (*-i).(M-^L_). (") 

Now we have (2t — l)a > p — 3a by definition of i. Since |A| < 3m + 5 we have a = 1(A) < 
2(\A\ — 1) < 6m + 8, and since we have supposed m > 6, we get a < 8m. We therefore have 

(2t - l)a > p - 3a > p - 24m > (1 - 12e)p. (12) 

By the hypothesis of the Lemma we have pA > 1/(2 — 5). Together with (|12p we get, writing 
\S'\ < \S\ < p/2, 

\S'\ 1 1 

pA - — - 1 — — > 



(2t-l)a 2 - 5 2 - 24e 
Finally, applying again ([T2"j) . inequality (fTTj) becomes 



IS + Ai > |S| + |(1 - 12.) - • (13) 

Now recall that by definition of A we have \ A\ > m+3. We will therefore get that (|13p contradicts 
§3§ whenever the righthand side of (|13[) is greater than |5|+2|^4|. Since \A\ < 3m+5 < 4m < 2ep, 
a contradiction is obtained whenever 
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For e < 10 -4 the inequality (|14p is verified for every 5 > 5 • 10~ 3 . Since Lemma [T5l allows us to 
choose 5 up to the value 10 _1 , the hypothesis £a(S) + £(A) > p can not hold, so that the result 
follows from Lemma [TUl □ 

4.4. Structure of S when 1(2 A) is small. To conclude the proof of Theorem [3] it remains to 
consider the case where £(A) > (2 — <5)(|A| — 1). We break up the proof into several lemmas. 

Lemma 18. Suppose 6 < m < e\S\ with e < 1CP 4 . Suppose furthermore that £(A) > (2 — 
5)(\A\ - 1). Then 

(i) \2A\ > {3-5){\A\ - 1). 

(ii) 1(A) < (l-5/2)|2A|. 

Proof. By point (a) of the final argument in the proof of Lemma [15] we know that we can not 
have \2A\ < (3 - S)(\A\ - 1). This proves (i). 

Since A is contained in an arithmetic progression of length less than p/2 (Lemma I14p we have 
1(A) < (1(2 A) + I)/ 2. Now Lemma Q2] implies 1(2 A) < (2 - 8)(\2A\ - 1), hence (£(2A) + l)/2 < 
(1 - 5/2)\2A\. This proves (ii). □ 

Next we apply the Pliinecke-Ruzsa inequalities to exhibit a subset T of S that sums to a small 
sumset with 2A. We then show that this set T must be contained in an arithmetic progression 
with few missing elements. 

Lemma 19. Suppose 6 < m < e\S\ with e < 10~ 4 . Suppose furthermore that 1(A) > (2 — 
5)(|A| — 1). Then there exists T C S such that, denoting A = |T|/|5|, 

\2A\ < A(4 + 10e)(|A| - 1), (15) 
£(T) < \T\ + 2£(A). (16) 

Proof. By Theorem [12] and ([3]), there is T C S such that 

\A\+m. 9l , , , ,, Al AT\ (\A\ + m) 2 \T\ 
|T + 2yl|<(l+ l f\T\ < \T\ + 2(\A\ +m)f=f + ' 



|5| y 1 1 - 1 1 Vl 1 |5| |5|" 

Writing |^4| + m < 3m + 5 + m < 5m < 5e\S\ and A = |T|/|<S| we get 

\T + 2A\ < \T\ + X(\A\ +m)(2 + be) < p. (17) 

Now apply the Cauchy-Davenport Theorem \T + 2A\ > \T\ + \2A\ — 1 in (jlTj) to get, since 
\A\ > m + 3, 

\2A\-1 < X(2\A\ -3)(2 + 5e), and 

\2A\ < 2A(2 + 5e)(|A| - 1) - A(2 + 5e) + 1. (18) 
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Notice that if A(2 + 5e) < 1 then ([HI) gives \2A\ < 2(\A\ - 1) + 1 which contradicts the Cauchy- 
Davenport Theorem. Therefore we have 1 — A(2 + 5e) < and (|18|) yields (|15p . 

In the remaining part we prove (|16j) . Recall that the hypothesis of the present lemma together 
with Lemma [T5l imply 

£{2A) < (2-6){\2A\ - 1). (19) 

Suppose first that 

£ 2A (T) + £(2A) > p. (20) 

Set a 2 = £(2A) and p = 2ta 2 + r with < r < 2a 2 . Let I = I± U • • • U I 2 t with Ij = [(i — l)a 2 ,ia 2 ). 
By ([20]) we have T n h / for each i = 1, . . . , 2t. By Lemma OH with B = T and C = 2vl, 

| T+2A |> m + (t _I )a2 ^ (2A) _ (21) 

where T' = T f] I . By (|19p we have a<i < 2\2A\, so that by using (|15p and A < 1 we obtain the 
following rough upper bound 

a 2 < (8 + 20e)|A| < 9(3m + 5) < 36m 

where we have used e < 1/20. 

As in the proof of Lemma [TTl we have, by definition of t, 

(2t - l)o 2 > p - 3a 2 > p - 108m > p(l - 54e) (22) 
so that, writing \T'\ < \T\ < \S\ < p/2, and applying (fT9j) we have 

/ ^ \ T '\ 1 1 



(2t- l)o 2 - 2-5 2- 108e" 
Applying again (|22p . inequality (|21|) becomes 

IT + 2AI > |T| + ^(l-54e) ( ^— ]. (23) 

1 1-11 2 V 1 \2 - 5 2 - 108e J v ; 

On the other hand, (|17p implies 

|T + 2A| < |T| + 10m + 25em < \T\ + p(5e + 25e 2 /2) 
which together with (|23p gives 

5e + 25e 2 /2 > -(1 - 54e) f^— V (24) 

7 ~ 2 V 1 \ 2 - 5 2 - 108e/ v ; 



For e = 10 -4 the inequality §M) fails to hold for each 5 > 2 ■ 10~ 2 . Since (|19p holds for every 
5 < HT 1 , the hypothesis (JUJ) can not hold, so that the sumset T + 2A behaves like a sum of 
integers. Let us write 

\T + 2A\ = \T\ + 1 2,4 1 + n 
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and check that the conditions of Theorem [7] hold. By Lemma [181 (i) we have 

\2A\ > (3-5)(\A\ -1) 

> (2 + 5e)|A| + (1 - 5 - 5e)\A\ - 3 

>(2 + 5e)|A| + ~ 
since m > 6 and \A\ > m + 3 > 9. Therefore 

2\2A\ > 2(2 + 5e)|A| + 3 

> (2 + 5e)(|A| + m) + 3, 
which, since fi < (\A\ + m)(2 + 5e) - \2A\ by (JTTJ), leads to 

\2A\>fi + 3. (25) 
Now by definition of A we have \T\ = X\S\ and we also have \S\ > 1 le| *S*| , so that 

\T\ > \1U\S\ > Xllm 

> A(2 + 5e)5m > A(2 + 5e)(\A\ + m) 

and, since \i < X(\A\ + m)(2 + 5e) — \2A\ by (fT7|) . we obtain 

\T\ >fi+\2A\ > [i + 4. (26) 

Inequalities (|25|) and (|26|) mean that Theorem [7] holds and we have : 

£(T) < \T\ + n + 1 < \T\ + \2A\ < \T\ + £(2A) < \T\ + 21(A). 

This proves (I16p and concludes the lemma. □ 

Lemma 20. Suppose 6 < m < e\S\ with e < 10~ 4 . Suppose furthermore that £(A) > (2 — 
S)(\A\ - 1). Then £(S) < \S\ + m+ 1. 

Proof. Let T be the set guaranteed by Lemma [T9l Let T = S \ T, which belongs to an interval 
of length p — £(T). Set a = £(A). Let us apply again Lemma [TBI this time with B = S, C = A, 
and t defined so as to have p — £(T) = 2ta + r, < r < 2a. As before, set I = I\ U • • ■ U I%t with 
Ii = [(i — l)a, ia). Note that T n / = 0, so that T (1 1 = S (1 1. Let us first suppose 

£ A (S)+£(A)>p (27) 

which implies Tnl{ ^ for every i = 1, . . . , 2£, so that by Lemma [TB] and denoting T = TC\I = 

sm, 

\s + A\ > \s u [(S + A) n i}\ 

> ISI + ^-^^-J^L). (28) 
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By definition of t and by (|16p we have 

(2i - l)o > p - £(T) - 3a > p - \T\ - 5a. (29) 

Now Lemma [18] (ii) and f)15[) give the following upper bound on a 

a < \2A\ < A(4 + 10e)|i4| < A(4 + 10e)4m < A(4 + We)2ep 

so that we can write -5a > -Xf(e)p with /(e) = 10(4 + 10e)e. Writing \T\ = X\S\ < Xp/2, (|2"9"]1 
becomes 

(2t-l)a>p(l-A(i + /(e))) (30) 

Next we write |T | < \T\ = \S\ — \T\ = (1 — A)|<S|, so that | *S' j < p/2 gives 

|5*|<f(l-A) (31) 

Finally we bound pA from below. Apply again Lemma [18] (ii) and (|15p to get 
i(A) < (1 - 5/2)\2A\ < (1 - o/2)A(4 + 10e)(|A| - 1), 

so that we have 

» A £ w^JWT^ry (32> 



Applying ([30]), §T§ and ([32]) to ([28]) now gives 

l-A(| + /(e)) 1 



\S + A\ > \S\ + | 



A(l-<y/2)(4 + 10e) 2 

Together with ([3]), writing |^4| < 4m and m < ep/2, we obtain 

l-A(i + /(e)) 1 
A(l-V2)(4 + 10 g ) "2 (1 - A) - 5£< °- (33) 

Now there exists > 5.8 10 -3 > such that for every e < e<s, the lefthandside of ([53"]) is strictly 
positive for every value of A G [0, 1]. In that case ([33]) can not hold and we obtain a contradiction 
with the hypothesis (|27l) . Therefore Theorem [7] implies the result. □ 

Numerical values: As it has been shown in the proofs Theorem [3] holds with e = 10~ 4 . As for 
the value of po, we use m > 6 in Section 4, so in order to cover smaller values of m, the prime 
p should satisfy the condition in Lemma [TT1 that log 4 p > 6m + 11 > 47 which is equivalent to 
p > 2 94 . We have tried to strike a balance between readability and obtaining the best possible 
constants. These values of e and po are not the best possible, but they give a reasonable account 
of what can be achieved through the methods of this paper. 
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